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Contextuality is the leading notion of non-classicality for a single system. However, experimental 
demonstrations of contextuality have generally been limited by conceptual difficulties. Here I focus 
on the simplest non-trivial case, four preparations and two tomographically complete binary mea¬ 
surements. Exploiting a subtle connection to the CHSH scenario gives eight non-linear inequalities 
which are together necessary and sufficient for the experimental statistics to admit a preparation 
noncontextual model. No fixed “operational equivalences” are required. Violating such an inequality 
is therefore arguably the most direct route to an experimental refutation of noncontextuality. 


The gold standard for an experiment that defies classi¬ 
cal explanation is the violation of a Bell inequality m®. 
In the case of a single quantum system, this is not a pos¬ 
sibility, and so attention has focussed on contextuality. 

Contextuality was first identified by Bell, Kochen and 
Specker 0 SI- Whilst this was a profound insight 
into quantum mechanics, the definition they used is not 
amenable to principled experimental test (see mm and 
references therein). The definition was later refined and 
generalised by Spekkens [7j. As shown in [5] , this defini¬ 
tion, or even just one component of it known as prepara¬ 
tion noncontextuality , is more suited to experiment. 

However, [8| used an assumption that two preparation 
procedures were indistinguishable, which was not satis¬ 
fied exactly in the reported experiment and never will be 
in any experiment. Here I show how this problem can 
be eliminated by providing a full characterisation of the 
preparation noncontextual statistics in the simplest sce¬ 
nario to which the concept applies. The shift in approach 
is that, with the help of tomographically complete mea¬ 
surements, indistinguishable preparation procedures are 
inferred from the statistics, rather than posited a priori. 

Definitions. Consider an experiment where one im¬ 
plements a preparation procedure V t followed by a mea¬ 
surement procedure A4j with outcome k , characterised 
by the probabilities P{k\Pi, M.j). An ontological model 
seeks to explain these results via an ontic state A that 
screens off the preparation from the measurement result: 

P(k\Vi, Mj) = J P(k\X, Mj)pi(X)dX, (1) 

where we use the shorthand pfi A) = P(\\Vi). 

The explanation proffered by an ontological model is 
compelling only if it does justice to important features of 
the observed statistics. For example, in a bipartite sce¬ 
nario, Bell’s locally causal models would provide a natu¬ 
ral explanation for the observed no-signalling [Sj. 

Preparation noncontextuality concerns a closely re¬ 
lated feature, namely operational equivalence among mix¬ 
tures of preparations. Suppose there exists probability 
distributions {pi} and {g.;} such that for all j, k 

Y^PiPWVuMj) = Y J dr p {k\P l ,M J ). ( 2 ) 

i i 


Then we say that the probabilistic mixtures, which 
might be written YliPi'P* an d QiPu are operationally 
equivalent. In principle it is possible that somebody 
invents a new measurement procedure M! such that 
YjiPiP(k\Vi, M') ^ QiP{k\Pi, M'), in which case the 
apparent operational equivalence would evaporate. For 
the time being we will assume no such A4' exists. In 
the language of quantum theory, where preparations Vi 
are associated with density operators pi, we are assuming 
that YjiPiPi = Yhi QiPi- This follows from eq. |2| if and 
only if the POVMs associated with the A ij are tomo¬ 
graphically complete [lD] or fiducial m for the density 
operators spanned by the pt. This crucial issue will be 
returned to later. 

What can explain the inability of any measurement 
to distinguish '^2 i PiPi from )>A QjVfi The most natural 
explanation is that this “distinction without a difference” 
is no distinction at all: 

^2 ( A ) = qiixi ■ ( 3 ) 
i i 

The inference from the operational equivalence © to the 
ontic equivalence ([3]) constitutes the assumption of prepa¬ 
ration noncontextuality [7J. (For comparison, measure¬ 
ment noncontextuality is the assumption that measure¬ 
ments that cannot be distinguished by the statistics for 
any preparation are equivalent in the ontological model.) 

The simplest scenario. Suppose we consider only 
a single measurement AJi. Then the simple “A = k" 
model P(k\X,Mi) = Sk\, Pi{X) = P(k = \\Vi,M{) is 
manifestly preparation noncontextual. Hence we need 
to consider at least two measurements, the simplest case 
being two binary (two-outconre) measurements. 

In this scenario the operational probabilities for a sin¬ 
gle preparation Vi are given by the 2-dimensional real 
vector ^ = (P(0\Vi, M 0 ) - P(1\V U M 0 ),P(0\Vi,Mi)- 
P(l\Pi,Aii)) which (along with normalisation) fixes all 
4 probabilities. Hence eq. © has the geometric form 
^ ‘ZiPi'Pi = QiPi- The convex hull of any three Vi is a 
simplex: a point, a line segment, or (the generic case) a 
triangle. Consider the ontological model in which there is 
an ontic state for each extreme point of the simplex, with 
Pi( A) = 1 for the one or more V t located at the extreme 
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FIG. 1. Example statistics. The preparations have been la¬ 
belled in accordance with the conventions that Vo is opposite 
Vz and {Vo — Vz,V 2 — Vi} is positively oriented. Also shown 
is c as defined by eq. §. 


point A, and p(k |A, Mi) defined by the coordinates of the 
extreme point in the obvious way. Expanding eq. <[2j) as 

= (4) 

i ,A i ,A 

the fact that every point in a simplex has a unique convex 
decomposition into extremal points immediately gives 
eq. ©> so our model is preparation noncontextual. 

Hence the simplest scenario in which one can demon¬ 
strate noncontextuality, and the focus of most of this ar¬ 
ticle, is four preparations and two binary measurements. 

Labelling conventions. Denoting the four prepa¬ 
rations {Po,Vi,V 2 ,V 3 }, the Vi = ( Xi,yi ) must be the 
vertices of a quadrilateral, since any degeneracy will lead 
to a simplex and hence an immediate preparation non¬ 
contextual model by the above argument. As in fig. [I] we 
adopt the conventions that Vo is opposite to V 3 , and the 
Vo — V 3 and V 2 — V\ diagonals are positively oriented: 

x 0 x 3 X 2 X\ ^ q 

2/o - 2/3 2/2 - 2/i ' ' 

A pivotal equivalence. One example of an opera¬ 
tional equivalence is given by the point c at which the 
{Vo,V 3 } diagonal intersects {V \, V 2 } diagonal, giving 
probabilities p, q such that 

pV 0 + (1 - p)V 3 = qV 1 + (1 - q)V 2 = c. (6) 

Preparation noncontextuality then demands that 

PMo(A) + (1 - p)MA) = qp l(A) + (1 - q)p 2 (A). (7) 

I will now show that in the current scenario, this single 
equivalence is in fact sufficient for a model to be prepa¬ 
ration noncontextual. 

Suppose we have a preparation contextual model, i.e. 
there exists Pi,qi such that eq. <[2j) holds yet eq. ^ 


fails. We want to prove that eq. |7| must also fail. 
The first step is to show that eq. (|3| must fail for some 
Pi,q'i with Y.iV\Vi = Ei Qi'Pi = c. To see this, denote 
p = 'EiPiVi = E,: qiPii and notice that since c is in the 
interior of the quadrilateral c = E; r iV’i + r*p for some 
probability distribution {r 3 , tt, r 2 , r 3: r*} with r* > 0. 
But then p[ = Vi + r*pt and q[ = Ti + r*qt give the re¬ 
quired instance, with the failure of © ensured by the 
fact it fails for the pi, qi and r* > 0. 

Now I will argue that there exist probabilities s, t such 
that {Po,Pi,£> 2 , 7 * 3 } = { s P>( 1 - s )<b ( 1 - s )( 1 -9),s(l-p)} 
and {q , o,q'i,q , 2 ,q' 3 } = {tp, (1 -t)q, (l-t){l-q),t{l-p)}, 
i.e. that E,: PjE amounts to preparing pV 0 + (1 — p)V 2 
with probability s and qV 1 + (1 — q)V 2 with probability 
(1 — s) (similarly for the q[ with t in place of s ). If that is 
the case then 0 implies E iPiPiW = E i <&PiW and so 
the failure of the latter requires the failure of the former, 
which is what we wanted to prove. 

To find s and t it is useful to make an affine transfor¬ 
mation to a new co-ordinate system in which c = (0, 0), 
Vo = (1 —p, 0) and Vi = (0,1 —q). ThenpV 0 +(l-p)V 3 = 
c gives V 3 = (—p, 0) and qV± + (1 — q)V 2 = c gives 
V 2 = (0, —q). Now Y.iPi'Pi = c becomes (p' 0 ( 1 — p)— 
P 3 P,p'i{ 1 ~ q) - P' 2 q) = (0,0). Defining s = p' 0 /p > 0 
we have p' 3 = s(l — p). Similarly defining s = Pi/q > 0 
we have p' 2 = s(l — q). E iPi = 1 gives s + s = 1, and 
repeating the same argument with the q[ gives t. 

The connection to CHSH. It was first shown by 
Barrett that the existence of a preparation noncontex¬ 
tual model for a single system implies the existence of 
a locally causal model for any bipartite scenario involv¬ 
ing that system |12j . in other words, any bipartite proof 
of Bell’s theorem is a proof of preparation contextual- 
ity. The converse is not expected to hold in general (al¬ 
though certain proofs of preparation contextuality can 
be converted into bipartite proofs of Bell’s theorem El). 
Nevertheless, with the reduction of the previous section 
in hand, Barrett’s argument can be extended to see that 
the existence of a preparation noncontextual model for 
four preparations and two binary measurements is equiv¬ 
alent to the existence of a Bell local model in the scenario 
considered by CHSH [TT| . 

In the relevant Bell scenario two parties choose be¬ 
tween two measurements, their choices labelled a and 
b. They both obtain a binary outcome, labelled x and y. 
Their statistics P(o, b\x, y) are related to the preparation 
noncontextuality scenario by 


/P(0,fc|0,j) P(l,k\0,j)\ 

V (0, k\l,j) P(l,k\l,j)J 

_ fpP(k\Vo,Mj) (l-p)P(k\V 3 ,Mj)\ 
\qP{k\Pi, Mj) (l-q)P(k\V 2 ,M j )J ■ 

These statistics are normalised, and are no-signalling 
due to the operational equivalence (|6|. If we have 
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a preparation noncontextual model, then set p(X) = 
PPo{ A) + (1 -p)n 3 (\), 


(Pa( 0|A,0) Pa(1|A,0)\ 

_ (pT o(A)MA) (1 -p)p 3 (X)/n(X)\ ,, 

- Vr(A)MA) (l-g)M2(A)/MA)J W 

(which is normalised by eq. ([2])), and P B (k\\,j) = 
P(k\\,Mj). Then eq. |l]) gives the locally causal model 


Proof. Denoting the LHS of ([5| by D, Cramer’s rule gives 


x 2 - x 3 x 2 ~- X\ 
Vi -2/3 2/2 - 2/i 
Xq - x 3 x 2 - x 3 
2/o ^ 2/3 2/2 - 2/3 


/D, 


ID. 


(13) 

(14) 


Substituting these into © and multiplying through by 
D > 0 gives, upon expanding all the determinants, (12). 
The reader can avoid an algebraic quagmire by referring 
to the Mathematica notebook provided on the arXiv. □ 


p(a, b\x, V) = j Pa(<A A, x)P B (b\\, y)p(X)d\. (10) 

If, conversely, we start with a locally causal model, 
then inverting eq. ([9]) gives an ontological model for 
P{k\Vi, Mj), where eq. (J?]) is guaranteed by the normal¬ 
isation of the locally causal model. Since we have seen 
that eq. 0 is sufficient for a preparation noncontextual 
model, we have established the desired equivalence. 

Fine [15] has shown that the eight versions of the 
CHSH inequality m are necessary and sufficient for 
the existence of a locally causal model. Hence given 
P{k\T’i 1 Mj), one can calculate the corresponding Bell 
scenario probabilities (Jsj) and then use the eight CHSH 
inequalities to determine whether or not a preparation 
noncontextual model exists. 

A closed expression. This above argument com¬ 
pletely characterises the noncontextual statistics in our 
scenario. However, it might appear that if one were to 
calculate p and q explicitly and substitute ([8]) into the 
CHSH inequalities the result would be extremely convo¬ 
luted. In fact it can be written in a remarkably simple 
form, thanks to following lemma. 

Lemma 1. Suppose Vi = (xt, yt) for i = 0,1, 2,3 satisfy 
eq. 0, and p and q are defined as the solutions of eq. 0. 
Then for any real numbers {z 3 , z 3 , z 2 , z 3 }, 

pz 0 + (1 - p)z 3 < qz\ + (1 - q)z 2 (11) 


If we substitute eq. ([8]) into the CHSH inequalities we 
obtain (11) with z 3 = CqXq + do2/o — 1, z\ = —c\X\ — 
di 2 /i + 1, z 2 = cix 2 + dxy -2 + 1, z 3 = -c 0 x 3 - d 0 y 3 - 1, 
where (cq, do, c\, d\) is a column of 


/ 1 1 1 1 -1 -1 -1 -l\ 

1 1-1-11 1-1-1 
1-11-11-11-1 
\-l 1 1-11-1-11/ 


(15) 


(There is one column for each version of the CHSH in¬ 
equality.) Substituting the Zi into (12) then gives the 
desired inequality. For example the (A 0 Bo) + (A 0 B 1 ) + 
(AiB 0 ) — (A 1 B 1 ) < 2 version of the CHSH inequality cor¬ 
responds to the first column of (15), and (12) becomes 


xo Vo x 0 + yo~l 1 
X\ 2/1 - 2 d + 2 /i + 1 1 
x 2 2/2 x 2 - 2/2 + 1 1 

2-3 2/3 -2:3 - 2/3 - 1 1 


(16) 


(Subject to ([5]), this can still serve as a Bell inequality, 
but now in terms of p{a\b, x, y) rather than p(a, b\x, y).) 

Quantum violation. Suppose A4o and M\ corre¬ 
spond to X and Z measurements of a qubit. Let V 3 cor¬ 
respond to preparing the +1 eigenstate of (X + Z)/\J 2 
and V 3 the —1 eigenstate. Similarly let {V\,V 2 } be the 
{+1, —1} eigenstates of (X — Z)/\/2. Denoting?; = 1 / v/2 


the LHS of (16) is 


if and only if 


xo 2/0 z 0 1 

xi yi zi 1 

x 2 2/2 Z 2 1 
2^3 2/3 z 3 1 


< 0. 


( 12 ) 


Furthermore, equality in and (fl2| is also equivalent. 


Geometrically this lemma concerns a tetrahedron with 
vertices ( Xi,yt,Zi ). Equation (11) asks whether, when 


the 0-3 edge meets the 1-2 edge in the {x, y)-plane, it is 
below in the ^-direction. Subject to the convention |5| 
that is equivalent to the statement © about the signed 
volume of the tetrahedron. A purely algebraic proof can 
be given as follows. 


v v 2v — 1 1 
v —v 1 — 2v 1 
—v v 1 — 2v 1 
—v —v 2v — 1 1 


= 16n 2 (2n - 1) « 3.31. (17) 


This is the same proof of the preparation contextuality 
of a qubit that appeared in [8]. However, that proof as¬ 
sumed \V\ + \V 3 = fV\ + \V 2 , which will never hold 
exactly in a realistic experiment. Since no such assump¬ 


tion entered into (161, the proof presented here is more 


experimentally robust. 

Tomographic completeness. A new difficulty arises 
in the present approach, namely our assumption that 
Mo and M\ are tomographically complete for the Vi- 
An obvious objection in the qubit example of the pre¬ 
vious section is the Y measurement. Suppose we have 
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observed failures of the latter can be dealt with using 
convexity, non-classicality may be left as the only plau¬ 
sible explanation. 

The main direction for future theoretical work is to 
classify scenarios with more preparations and measure¬ 
ments. Failures of tomographic completeness could then 
be dealt more elegantly than above, by fully incorporat¬ 
ing the extra procedures into the contextuality scenario. 
It would also be interesting to apply similar ideas to mea¬ 
surement and transformation noncontextuality. 


FIG. 2. Dealing with failure of tomographic completeness. 
The statistics for three binary measurements {.Mo, Mi, M'} 
are defined by a 3-dimensional vector for each preparation. 
Two views of this space are shown. The four non-planar 
preparations in brown, together with additional preparation 
in blue, imply the existence of the four planar preparations 
in green. The statistics for the green preparations can be cal¬ 
culated with simple trigonometry and then tested against the 
inequalities derived here for two binary measurements. 


a preparation noncontextual model for three measure¬ 
ments {A4o, A4i, A4'}, and consider a set of preparations 
that all give the same probability for A4'. Since eq. ([2j) 
holds for all three measurements if and only if it holds for 
{A4o, A4i}, the problem reduces to finding a preparation 
noncontextual model for A4o and M x . Since the quan¬ 
tum states mentioned above all give uniformly random 
outcomes for the Y measurement, there is in principle 
nothing wrong with using (16). 

In practice, however, the preparations in a real experi¬ 
ment will each give slightly different probabilities for the 
Y measurement. The simplest way to deal with this is to 
add a fifth preparation that gives one outcome of the Y 
measurement with high probability (e.g. the +1 eigen¬ 
state of Y). If, as in fig. |2j we then consider a plane per¬ 
pendicular to the Y axis with the four original prepara¬ 
tions on one side and the new preparations on the other, 
convexity implies the existence of, and determines the 
probabilities for, four preparations in the plane that can 
then be tested against (16). Notice there is no need to ac¬ 
tually implement the four new preparations. Any further 
measurements that reveal small amounts of information 
about the preparations can be dealt with similarly. 

This idea has been generalised m into a technique for 
identifying experimental violations of other noncontextu¬ 
ality inequalities, even those requiring fixed equivalences. 

Conclusions. The eight inequalities derived here fully 
classify the preparation noncontextual statistics in the 
simplest non-trivial scenario. No assumptions of de¬ 
terminism or unattainable operational equivalences were 
made. No new assumptions on the representation of ap¬ 
proximately operationally equivalent procedures pang 
were made either. Hence a violation can only be ex¬ 
plained by a failure of noncontextuality or a failure of the 
tomographic completeness of the measurements. Since 
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